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Some proof theoretical remarks 
on quantification in ordinary language 



Abstract: This paper surveys the common approach to quantification and 
generalised quantification in formal linguistics and philosophy of language. We 
point out how this general setting departs from empirical linguistic data, and 
give some hints for a different view based on proof theory, which on many aspects 
gets closer to the language itself. We stress the importance of Hilbert's oper- 
ator epsilon and tau for, respectively, existential and universal quantifications. 
Indeed, these operators help a lot to construct semantic representation close 
to natural language, in particular with quantified noun phrases as individual 
terms. We also define guidelines for the design of the proof rules corresponding 
to generalised quantifiers. 

Resume Cet article dresse un rapide panorama de l'approche commune de 
la quantification generalisee ou non en linguistique formelle et en philosophic 
du langage. Nous montrons que ce cadre general est va parfois a l'encontre des 
donnces linguistiques, et nous donnons quclques indications pour unc approche 
diffcrente basee sur la theorie de la demonstration, qui sur bien des points 
s'avre plus proche de la langue. Nous soulignons l'importance des operateurs 
tau et epsilon de Hilbert qui rendent respectivement compte de la qualification 
universelle et existentielle. En cffct ccs operateurs permettent de construire des 
des representations semantiques qui suivent la lange avec, en particulier des 
groupes nominaux quantificcs qui soicnt des termes individuels. Nous donnons 
aussi des principes pour definir des rgles de deduction qui correspondent aux 
quantificateurs generalises. 



1 Foreword: empirical data on quantification 



Although quantification in logic, in linguistics and in philosophy of language 
are much studied we think that the dominant model theoretic approach partly 
leaves out linguistic reality. [TUl E2 HH |3] 

Quantifiers are quite common in ordinary language and even more in special- 
ity languages like sciences, in particular mathematics... unfortunately for the 
linguists, the more difficult quantifiers are the most common in natural language. 
Indeed, as we shall see besides the classical quantifiers of mathematics for all V 
and there exists 3, natural language makes use of many other quantifiers, some 
of which are only implicitly stated. Consider for instance the plural definite 
determiner the it can be used as universal quantification as in The children fell 
asleep, and it can be used as most was in The Brits love Franc^\ which does not 
mean that the Brits all do, but more likely that a large part of them do, or even 
that a short majority does as in France may have elected Francois Hollande, but 
she really needs a Napoleon Bonaparte. Let us first say a linguistically oriented 
word on the usual quantifiers, namely for all and there exists. 

The existential quantifier is omnipresent in natural language. As soon as one 
speaks about something, this "something" is existentially quantified as in A man 
enters. He whistles .*- us ) The introduction of new referents, that are existentially 
quantified, and that are the beginning of anaphoric chains are an important 
par of the structure of a discourse. For instance, observe that the scope of the 
existential quantifier extends to the next sentence — this has been a reason to 
for introducing various dynamic logics. The behaviour of existential quantifiers 
actually lead to a formalism which is structured by existential quantification, 
namely Discourse Representation Theory, which construct logical formula in a 
way that better matches the linear progression of sentences. It is often opposed 
to compositional semantics, since DRT proceeds top down from larger units to 
the smallest, words, but there by now exist compositional formulations of DRT. 
So existential quantifiers are really present in discourse and conversation, and 
they contribute to the structure and coherence of a discourse. [6] 

The universal quantifier, as it is use in mathematics, is rather rare in ordinary 
conversation, as corpora study shows, an exception being the reference to a 
group that can be inferred from the context like in They all went to bed.. The 
negative formulation is more frequent no one, nothing etc. and if no one came 
it means that every one did not come. 

When speaking about natural language it is often assumed that the individ- 
uals that is being referred to by quantification is finite. This is not the case. 
Firstly we meets such expression in mathematical discussions, but also various 
cases: 

(1) He wrapped up by explaining the dark future for the Universe when all 
the stars go away. 

1 Unless otherwise stated our examples are from the web. When they are ours, e.g. for 
phrasing differently an example, we use the mark ( us '. 

2 Another problem might be that the quantity a quantifier refers to can be continuous, as 
in We drank all the wine. 
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(2) All atoms are made from the same bits, which are called subatomic par- 
ticles 

(3) Just about all sentences in the English language fall into ten patterns 
determined by the presence and functions of nouns, verbs, adjectives, 
and adverbs. 

(4) All ideas are welcome. 

The most that we discussed above is also quite common and also applies to 
infinite collection of objects: 

(5) In basic math, we're taught multiplication tables. We learn that most 
numbers are the answer to at least two different multiplication problems, 
some numbers are the answer to several, and then... 

(6) Any module of known f3 is weak. Most numbers have even f3 and most 
of them are not antisymmetric. 

(7) The number one reason why most people fail at dieting is simple: they 
can't stick to it. 

(8) Since most numbers are not prime, it would waste time to check every 
number. 

(9) ... thus, in the limit most numbers are not prime. 

The human processing of quantifiers is quite difficult, in particular their 
alternation. There are experiments comforting this viewpoint, see e.g. |15j . 
But, on the other hand, one finds rarely but sometimes Henkin quantifiers in 
natural language, like Every member of the lab knows a member of every village 
sports club. What do people mean when using such quantifiers that would 
require a lifetime verification? This is a mystery, we know that our study, as 
well as others, via a confronting the formal model with what we hear or with 
psycholinguistics experiments may give some hints on the complexity of the 
human processing. 

These few introductory word should have convinced the reader that quan- 
tification is an important and common phenomenon in natural langue, although 
mathematical logic ignores it (an important exception being Hilbert, as we shall 
see), and although natural language processing consider it as rare and irrelevant 
(an exception being categorial grammar). So we agree with semanticists: quan- 
tification is an important question connecting logic, linguistics and philosophy. 

2 Remarks on the two mathematised quantifiers, 
there exists and for all 

The two quantifiers that are used in mathematics, namely V and 3 on individuals 
received a proper logical description, both in syntactical terms with rules (since 
Aristotle) and in semantical terms with interpretations in models (since the 
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beginning of the XXth century) . An important result is the coincidence of the 
two notions for first order logic known as the completeness theorem, established 
by Godel in 1930. It says that a formula is true in every model (1) if and 
only if it is provable (2) — the part (2) implies (1), known as soundness is 
easier to proved by induction on the derivation. Other famous theorems are the 
compactness and Lwenheim-Skolem theorems. The first one says that if any 
finite subset of a set F of formulae is contradiction free (admits a model) , so is 
F, and the second ones say that a theory with an infinite model and a countable 
language there are models of every cardinality larger than countable. [7] 

Those results are well known — especially completeness — and there is by 
now not much proof theoretical study of quantification. But one should be aware 
that this correspondence only works for first-order classical logic — and not for 
higher order logic. For instance when second or higher order quantification is 
used, sub-Boolean algebras have to be considered. When other logics are con- 
sidered trickier structures are required to maintain this correspondence. Even 
in the well studied case of intuitionistic logic, much more complicated models 
have to used, e.g. (pre)sheaves of L-structures or Kripke models. [3] 

Even if we only consider first order quantification, the correspondence be- 
tween provability and truth is not as transparent as one may hope. Natural 
language provides good insights to perceive the distinction between the proof 
theoretical and the model theoretical approach. More formally stated, com- 
pleteness expresses the coincidence between 

• provability h Vx. P(x) inferred from a proof without a free hypothesis 
involving a free x (a proof with a generic x (the precise rule is given 
thereafter) 

• and truth in every model, that is &z X £iP(x) in every domain / 

An important remark is that the proof theoretical viewpoint is finite, while the 
second one involves two infinite sets: models are usually infinite^, and the family 
of all models is usually an infinite set as well. The proof theoretic approach 
makes use of a generic element. To prove something about any number, the 
proof starts with Let n be any number. . . which means that n as no other 
property than being a number — this methods goes back at least to Pythagora, 
(-580 -490), long before any logical formalisation. 

It should be observed that natural language makes a distinction between the 
two notion of quantification above: each and all rather concern the complete 
enumeration of the elements in a collection while every, any or bare plurals (e.g. 
Ducks lay eggs) rather concern the generic elements, law and universal rules. In 
natural language quantification is commonly formulated with a restriction to a 
given class: All the stars go away, and this is also the case in usual mathematics: 

Vx G N 3ui G N 3u 2 G N 3u 3 e N 3u 4 e N x = (ui) 2 { Ul f (u 2 f (u 3 ) 2 M 2 

3 In first order logic, one can say that the model has less than n elements, but there is not 
way to say that the model is finite since it requires second order logic 
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When the restriction is a restriction to a well identified class like (inanimate) 
things or human beings, there are quantifiers without restriction to specific 
classes like everyone or everything are possible too: they apply to a single 
property. 

The restriction to a given class in quantification disappeared from mathe- 
matical logic (but not from type theory) because of the following observation 
of Frege. Indeed, for philosophical reasons, he insisted in having a single one- 
sorted universe, and using implicitly a correspondence between a set M the 
corresponding predicate (M(x)), one is able to represent quantification with 
restriction to a given class: 



To present examples from the most studied quantifiers, we follow Arsitotle 
square of oppositions: 

• All As are Bs 

• Some As are Bs 

• No As are Bs 

• Not alls A are Bs 

In the original formulation, the expression A and B denote terms, which are 
much vaguer than properties or predicates — in the Middle- Age, the theory of 
suppositiones already contributed to make the notion of term neater. 

Existential quantification (Some As are Bs) is formulated with some, there 
is, a it can also be formulated without restriction to a class of individuals by 
someone, or something. The last two statements exhibit that natural language 
is finer grained, and that it makes a difference because of the focus between 
statement that usual logic consider as equivalent. 

(10) There 's a tramp sittin' on my doorstep 

(11) Some girls give me money 

(12) Something happened to me yesterday 

(13) Some politicians are crooks 

(14) ? Some crooks are politicians^^ 

Observe that the existential negative (the fourth corner of Aristotle square 
of oppositions) is NOT lexicalised (in English and other languages). Psycholin- 
guists know that they is harder to grasp, rather ambiguous, and in ordinary 
conversation an existential negative statement is often understood wrongly as 
universal negative statements (e.g. example [15] is rather often understood as 
[To]) . Also observe in our unambiguous rephrasing[TH]of[TSl the focus has changed 



VieM P(x) 



Vx (M(x) => P(x)) 



3x G M P(x) 



3x {M{x)kP{x)) 
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from the ones that can be funded to the ones that cannot, making the rephrased 
statement tougher. 

(15) Not Every Picture Tells a Story 

(16) No picture tells a story( us > 

(17) Some Students Do Not Participate In Group Experiments Or Projects. 

(18) Everyone is entitled to an opinion, but not every opinion is entitled to 
student government funding. 

(19) Everyone is entitled to an opinion, but some opinions are not entitled to 
student government funding. 

We already provided some examples of universal statements, here are some 
more, ranging on a potentially infinite domain: 

(20) Each star in the sky is an enormous glowing ball of gas. 

(21) All groups of stars are held together by gravitational forces. 

(22) Terence Tao, a Fields medalist, has published a paper that proves that 
every odd number greater than 1 is the sum of at most five primes. 

The universal negative statements are expressed either by no, and without 
restriction to a class by no one, nothing, Here are some examples: 

(23) Because no planet's orbit is perfectly circular, the distance of each varies 
over the course of its year. 

(24) Nothing's gonna change my world. 

(25) Porterfield went where no colleague had gone previously this season, re- 
alising three figures. 

From a proof theoretical viewpoint how do 3 and V work? They both have 
introduction and elimination rules: 

• Universal quantifier 

— The V introduction rule says as above that when a property has been 
established for an x which does not enjoy any particular property (i.e. 
is not free in any hypothesis), one can conclude that the property 
holds for all individuals: 

no free occurrence of x 
in any Hi 
H u ...,H n \- P{x) 

H 1 ,...,H n \-y X . P{x) 1 
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— The V elimination rule says that when a property has been established 
for all individuals it can be inferred from any particular individual: 

Hi,...,H n h Var. P(x) 
H 1 ,...,H n \-P(a) 

• Existential quantifier 

— The 3 introduction rule says that whenever a property P holds of an 
individual a one can infer that there exists an individual enjoying P 

H 1 ,...,H n \-P(a) 
H 1 ,...,H n \-3x. P(x) 

— The 3 elimination rule is trickier. It says that if assuming P(x) and 
nothing more about x we derived C which does not depend on x, 
and if we have a proof of 3x. P{x) we can conclude, without the 
hypothesis P(x) that C: 

no free occurrence of x 
in any Hi nor in C 
H[,...,H' p h3x. P(x) P(x),H u ...,H n \-C 

Hi,..., H p , Hi,..., H n h C 

An alternative way to deal with quantification is to use individual concepts 
that are properties holding of exactly one individual. This is the reason why 
in Discourse Representation Theory some people write John(x) while others 
write j: the former notation describes an individual concept John(S) which has 
the property of being true of exactly one individual, and this can be stated 
in second and first order logic: X is an individual concept whenever C(X) = 
(\/x\/y(X(x)AX(y) x — y))A3x. X{x) — there are variants in which the exis- 
tence is not demanded. Universal quantification can be phrased with individual 
concepts via second order quantification. Indeed, Mx. P(x) can be expressed as 
VX(C(X) => P*{X)) with P*{X) = 3X. {X{x)kP{x)), and, conversely, given 
a property Q of individual concepts, the quantification VX. (C(X) => Q(X)) 
corresponds to Mx. Q b (x) with Q b (x) = 3X. (C(X) AX(x) AQ(X)). The equiv- 
alence are established by formal proof in second order logic, with second order 
quantification, and by duality the same result holds for existential quantifica- 
tion. There is a variant according to which the non emptiness is not required, 
since it is possible to name individuals that do not exist in this case one from of 
quantification is stronger than the other. One may wonder why using individual 
concepts and therefore use second order logic. A reason is that it is closer to 
the notion of term of Ancient and Medieval logic: both Socrates and human 
beings are terms. A more concrete reason is that, in common possible worlds 
semantics with rigid designators, it is impossible to interpret / do not believe 
that Tully is Cicero. [8] 
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Proof theoretic approaches naturally lead to rules of refutation. How does 
one refute a universal statement VxP(x)? Of course it amounts to try to prove 
the negation, that is an existential statement 3x^P(x), but there are different 
ways to do so. Consider the following example: 

(26) [The AKC notes] that any dog may bite. 

(27) No, Rex would never biteS us ' 

(28) Basset hounds do not biteS us ^ 

The difference is that the first answer picks up an element in the relevant 
model, while the second answer remains with generic elements. This is related 
to the Avicenian idea that a property of a term (individual or not) is always 
asserted for the term as part of a class: it is more related to type theory than 
to the Fregean view of a single universe. 

3 Hilbert's operators: l, e, t, . . . 

After the quantifier the one and unique individual such that P . . . introduced 
by Russell for definite description, Hilbert (with Ackerman and Bernays) inten- 
sively used generic elements for quantification, the study of which culminated 
in the second volume of Grundlagen der Mathematik [S]. It should be stressed 
that these operators are introduced and describe with natural language exam- 
ples, which is not so common in Hilbert's writings. We shall first present the e 
operator which recently lead to important work in linguistics in particular with 
von Heusinger work. [2J [TCI E] 

The first step due to Russell was to denote by i x . F the unique individual 
enjoying the property F in a definite description like the first sentence and to 
remain undetermined when existence and uniqueness do not hold. |12j 

(29) The present president of France was born in RouenS us ^ 
(existence and uniqueness hold) 

(30) The present king of France was born in PauS us ^ 
(existence fails) 

(31) The present minister was born in BarcelonaS us ^ 
(uniqueness fails) 

From this idea, Hilbert introduced an individual existential term defined 
from a formula: given a formula F(x) with a free variable x one defines the 
term e x . F in which the occurrences of x in F are bound (this is the original 
notation, nowadays this term is often written as e x . F). Whenever an element, 
say a, enjoys F, then the epsilon term e x . F enjoys F. 

Dually, Hilbert introduced a universal generic element t x . F, which corre- 
sponds to the generic elements used in mathematical proofs: if t x . F enjoys the 
property F then every individual does. 

The evident deduction rules for r and e are as follows: 
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• From A(x) with x generic in the proof (no free occurrence of x in any 
hypothesis), infer A(t x . A(x)) 

• From B{c) infer B(e x . B(x)). 

The additional rules can be found by duality: 

• From A{x) with x generic in the proof (no free occurrence of x in any 
hypothesis), infer A(e x . -*A(x)) 

• From B(c) infer B(t x . -iB(x)). 

Hence we have F(t x . F(x)) = Vx.F(x) and F(e x . F(x)) = 3x. F(x) and 
because of negation, one only of these operator is needed, usually the e operator 
and the logic is known as the epsilon calculus. . 

Hilbcrt turned these symbols into a mathematically quite nice object, since 
it allows to fully describe quantification and with simple rules. The first and 
second epsilon theorem basically say that this is an alternative formulation of 
first order logic. 

First epsilon theorem When inferring a formula C without e symbol nor 
quantifier from formulae T not involving the e symbol nor quantifiers the 
derivation can be done within quantifier free predicate calculus. 

Second epsilon theorem When inferring a formula C without e symbol from 
formulae T not involving the e symbol the derivation can be done within 
predicate calculus. 

This way, it provided the first correct proof of Hcrbrand theorem (much 
before mistakes where found and solved by Goldfarb) and a proof of the consis- 
tence of Peano arithmetic (at the same time as Gentzen did). The extension of 
the cut elimination patterns to these operators does not seem too complicated 
and people already worked on them, by [5] 

In Hilbert's book these operators are explained with natural language exam- 
ples, but a very important linguistic property is not stated: the e x F as a type 
(both in the intuitive and in the formal sense) of a noun phrase, and is meant to 
be the argument of a predicate (for instance the subject of a verb), thus being 
a suppositio in the medieval sense. [TJ [7] 

Nowadays there has been a renewed interest in these epsilon formulation 
of quantification, in particular by von Heusinger. He uses a variant of the 
epsilon for definite descriptions, leaving out the uniqueness of the iota operator 
of Russell, one reason being that the context often determines a unique object, 
the most salient one. As say it is a variant because it is not clear whether one 
still has the equivalence with ordinary existential quantification: he constructs 
an epsilon term whenever there is an expression like a man or the man but it is 
not clear how does one asserts that man(e x . man(x)). The distinction between 
e and 77 is that the former selects the most salient, while the later selects a new 
one. 
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4 Generalised quantifiers 



The abundant literature on generalised quantifiers takes place in a rather con- 
ventional setting, along the lines drawn by in particular by Frege. It is assumed 
that generalised quantifiers like thirty per cent or many or most are functions of 
two predicates. Indeed, the aforementioned Fregean trick to handle restricted 
quantification in a one-sorted logic (Vx € M P(x) = Vx (M(x) P(x))) 
does not apply: as an easy exercise shows, the two sentences below are not 
equivalent: 

(32) Most students go out on Thursday nights. 

(33) For most individuals, if they are students then they go out on Thursday 
nights. 

They are classified according to their behaviour with respect to the two pred- 
icates: covariant, contravariant, and properties wrt. existential quantification 
(weakness). Some rules are even provided, but in fine they amount to count 
how many elements are in the class corresponding to the restriction, in the 
main predicate itself and in their intersection. The proof rules that are given, 
at some point, always refers to the intended model, and at this point one can 
see that they are not proof rules in the ordinary sense. This sometimes leads 
to inaccurate results. For instance, most is defined as the majority of although 
their actual usage is quite different: one can assert the first sentence but not 
the second one in: 

(34) The majority of French electors voted Hollande in 2012 

(35) Most French electors voted Hollande in 2012 

Indeed, most is a vague quantifier, and as many starts beyond a flexible 
and context dependent percentage. The fact that the cardinality approach, the 
usual one, is wrong is proved by the following examples: 

(36) Most numbers are not prime. 

(37) Most people have children, (no one can tell whether there is a finite 
number of them) 

Indeed, what is clear is that a measure is need but counting is not the only 
way to measure a set. Otherwise the first sentence would be false, while it 
appears in an advanced book on number theory, and while anyone knowing 
some basic maths agrees with such a sentence. 

Another problematic point is that the usual vision of (generalised) quanti- 
fiers is to consider them as functions of several predicates. The mathematical 
quantifier are functions of a single predicate: given a predicate P(x) one can 
form Vx. P(x). General quantifiers depends on several predicates, usuallly the 
restriction to a given class and the main predicate. The quantifier Q (some, all, 
most,...) in a sentence like Q children sleep is supposed, in the common view, to 
be a function that applies to the predicates children (restriction class) and sleep. 
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Hence Q children is meaningless in this traditional view. This goes against the 
syntactic and cognitive structure of language. If a speaker starts a sentence 
with Q children before he came to the verb, the hearer already has in mind 
an image of what is Q children with respect to children. This drawback of the 
common approach is avoided when using Hilbert operators or choice functions. 
Our argument can be considered as a formal developement of Geach ideas on 
terms and predicates [4] 

5 Towards rules for some generalised quantifiers 

With this idea in mind, we started to study quantification from two converg- 
ing viewpoints. One viewpoint is the interface between syntax and semantics, 
i.e. how do we map the syntactic structure of a sentence to a logical formula 
that represents the semantics of the analysed sentence. The other viewpoint is 
how one handles such formulae: how do we prove and refute them, how do we 
interpret them? 

Regarding the first viewpoint, we advocate for a typed version of Hilbert 's 
generic elements. Consider the following sentences: 

(38) Most dogs bite. 

(39) Most of the students that passed algebra passed logic. 

In those sentences the first restriction class, namely dogs can be viewed as 
a type, while in the second example the restriction class students that passed 
algebra is a formula with a unique free variable. The operator we use is a 
constant in a typed lambda calculus, and actually we have two distinct versions 
of the operator introducing a most generic element. A first one maps a type 
to an element in this type (hence the type of the operator is false, but there is 
no problem to have a constant with this type) and the second ones takes as its 
argument a predicate over the type student and return a student and adds a 
presupposition that this student has passed algebra. We propose to do so for all 
generalise quantifiers, that is to have generic elements corresponding to them, 
computed from typed Hilbert's style operators. This is a way to compute their 
semantic representations, but it does not say how they are interpreted (we only 
provided some hints), not how one argues with such quantifiers, [llj 

On this later point, regarding the the majority of quantifier, we thought 
about what could be rules and refutations for such a quantifier. There are at 
least two ways to refute that the majority of (meaning at least 50%) the A have 
the property P: 

• Only the minority of the A has the property P 

• There is another property Q which holds for the majority of the A, with 
no A satisfying both P and Q. 

To add a generalised quantifier to a proof system one should actually intro- 
duce a pair of dual quantifiers, a variant V* of V and a variant 3* of 3 Then one 
has to decide one of the following two possibilities, the first one being unlikely: 
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• V*x. A(x) implies Vx. A{x) and so 3x. A(x) implies 3*x. A{x) 

• 3*x. A(x) implies 3x. A(x) and so Vx. A(x) implies V*x. A{x) 

In both the cases, one of the two new quantifiers is obtained by extending an 
existing rule, and the other one oby restricting an existing rule. Although there 
cannot be complete set of rules, it is likely to find family of rules and refutation 
techniques that would model a large part of those quantifier's behaviour. 

6 Conclusion 

After some critics of the common set theoretic approach, we gave some hints 
for a proof theoretical approach to quantification. One idea is to use typed 
versions of Hilbert operator for computing the semantic representations. The 
other idea is to use principles on proofs and cu-climination to design the rules 
of generalised quantifiers. 
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